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PLASTID-TARGETING PEPTIDE 

The present invention relates to the production 
of proteins of interest in plants, and in particular to the 
targeting thereof to the plastid compartment. 

Plasts are intracellular organelle of 
chlorophyll-containing plants (algae, mosses, and higher 
plants) . Several main types of plasts can be distinguished 
according to their pigment content and the nature thereof: 
amyloplasts, which are rich in starch, chloroplasts in which 
the main pigments are chlorophylls, and chromoplasts for 
which the main pigments are keratonoids. These three 
categories of plasts which derive from common precursors, 
proplasts, have a common basic structure consisting of a 
double membrane enclosing the plastid stroma. In 
chloroplasts, there is a third membrane system forming, 
within the stroma, saccules called thylakoids . 

Besides their essential role in photosynthesis, 
chloroplasts are also involved in redox reactions, for 
example the reduction of nitrates to ammonium. Plasts also 
play an essential role in the biosynthesis and/or the storage 
of many molecules, among which mention will be made of 
starch, lipids, carotenoids, most amino acids, plant hormones 
(abscissic acid, precursors of gibberellins , jasmonate, 
etc . ) . 

Although plasts have their own genome encoding 
some of their proteins, a large number of the enzymes 
involved in the various plastid functions are encoded by the 
nuclear genome and imported into the plasts. 

This importation is carried out via a specific 
mechanism, which has more particularly been studied in the 
case of chloroplasts (for review cf. CHEN and SCHNELL, Trends 
Cell Biol. 9, 222-227, 1999 ; KEEGSTRA and CLINE, The Plant 
Cell 11, 557-570, 1999 ; SCHLEIFF and SOLL, Planta 211, 449- 
456, 2000 ; JACKSON- CONSTAN and KEEGSTRA, Plant Physiol. 125, 
1567-1676, 2001). This mechanism involves an import system in 
each of the two plastid membranes: in the outer membrane, the 
Toe (translocon at outer membrane of chloroplast) complex 
which comprises at least three proteins: Toe 86, 75 and 34 
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(KESSLER et al . , Science 266, 1035-1039, 1994 ; PERRY and 
KEEGSTRA, Plant Cell 6, 93-105, 1994); in the inner membrane, 
the Tic (translocon at inner membrane of chloroplast) complex 
which comprises at least four proteins: Tic 110, 55, 22 and 
5 20 (KESSLER and BLOBEL, Proc. Natl. Acad. Sci . 93, 7684-7689, 
1996; LUBECK et al . , EMBO J. 15, 4230-4238, 1996 ; CALIEBE et 
al., EMBO J. 16, 7342-7350, 1997; KOURANOV et al . , J. Cell 
Biol., 143, 991-1002, 1998), and also a chaperone protein in 
the stroma: ClpC (AKITA et al., J. Cell Biol. 136, 983-994, 

10 1997; NIELSEN et al . , EMBO J. 16, 935-946, 1997). 

A major element of this mechanism is Toc7 5, which 
is the most abundant protein in the outer membrane, and forms 
the central pore of the translocation channel located in this 
membrane (SCHNELL et al . , Science 266, 1007-1012, 1994; 

15 TRANEL et al . , EMBO J. 14, 2436-2446, 1995). Toc75 interacts 
■- specifically with a particular sequence, called "targeting 
peptide" or "transit peptide", located at the N-terminal end 
of the proteins imported into the plasts (MA et al . , J. Cell 
Biol. 134, 315-327, 1996). 

2 0 Many targeting peptides have been identified in 

the precursors of proteins targeted to the intermembrane 
space, the inner membrane, the stroma and, in the case of 
chloroplasts , to the thylakoid membrane. 

Among the proteins known to have a cleavable 

25 intraplastid- targeting peptide, mention will in particular be 
made of proteins targeted to the intermembrane space (Tic22: 
KOURANOV et al., 1998, mentioned above; KOURANOV et al . , J. 
Biol. Chem. 274, 25181-25194, 1999), proteins targeted to the 
inner membrane (TPT (Triose-Pi/Pi translocator) : BRINK et al . , 

30 J. Biol. Chem. 270, 20808-20815, 1995), proteins targeted to 
the stroma (ribulose-1 , 5-bisphosphate carboxylase (Rubisco) 
small subunit) : DE CASTRO SILVA FILHO et al . , Plant Mol . 
Biol. 30, 769-780, 1996; carbonic anhydrase) , proteins 
targeted to the thylakoid membrane (LHCP (light harvesting 

35 complex): LAMPPA et al . , J. Biol. Chem. 263, 14996-14999, 
1988; Cfo-II: ATPase subunit) and to the thylakoid lumen 
(OEE1 (Oxygen Evolving Element 1) : KO and CASHMORE, EMBO J. 
8, 3187-3194, 1989;. 
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These targeting peptides generally comprise 
between 40 and 100 amino acids, and most of them have common 
characteristics: they are virtually devoid of negatively 
charged amino acids, such as aspartic acid, glutamic acid, 
5 asparagine or glutamine; their N- terminal region is devoid of 
charged amino acids, and of amino acids such as glycine or 
proline; their central region contains a very high proportion 
of basic or hydroxylated amino acids, such as serine or 
threonine; their C- terminal region is rich in arginine and 
10 has the ability to form an amphipathic, beta-sheet secondary 
structure . 

In the case of proteins targeted to the thylakoid 
lumen, the targeting peptide is bipartite and comprises 
additional information for crossing the thylakoid membrane 

15 (DE BOER and WEISBEEK, Biochim. Biophys . Acta. 1071, 221-253, 
1991) . In certain cases, this bipartite targeting peptide can 
also be found in proteins targeted to the thylakoid membrane 
(KARNAUCHOV et al . , J. Biol. Chem. 269, 32871-32878, 1994). 

In all cases, the targeting peptide is cleaved 

20 after importation. This cleavage is carried out by specific 
proteases; a protease located in the stroma (VANDERVERE et 
al., Proc. Natl. Acad. Sci . 92, 7177-7181, 1995), and a 
protease located in the lumen of the thylakoid (CHAAL et al . , 
J. Biol. Chem. 273, 689-692, 1998) have been described. 

2 5 Proteins targeted to the outer membrane do not 

generally comprise a cleavable signal peptide; the targeting 
information is contained in the mature protein (CLINE and 
HENRY, Annu. Rev. Cell Dev. Biol., 12, 1-26, 1996); after 
they have been synthesized in the cytosol, these proteins are 

3 0 directly incorporated into the membrane (VAN'T HOF et al, 

FEBS lett. 291, 350-354, 1991 and J. Biol. Chem. 268, 4037- 
4042, 1993; PINADUWAGE and BRUCE, J. Biol. Chem. 271, 32907- 
32915, 1996) by means of interactions, the nature of which 
remains poorly understood, with the lipid bilayer. The only 
3 5 known exception to date concerns the Toc7 5 protein or 
(OEP75), the targeting of which to the outer membrane 
requires the presence of a cleavable, bipartite N- terminal 
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targeting peptide (TRANEL et al . , 1995, mentioned above; 
TRANEL and KEEGSTRA, Plant Cell 8, 2093-2104, 1996). 

It is known that the use of plast- targeting 
peptides is necessary for introducing into these plasts 
5 proteins of interest for acting on various plastid functions, 
in particular with the aim of improving the characteristics 
of plants of agronomic interest, for example the biosynthesis 
of lipids, of starch, of vitamins, of hormones or of proteins 
by said plants, or their resistance to diseases, to insects 

10 or to herbicides. For example, application EP 189707 proposes 
the use of cleavable targeting peptides derived from 
chloroplast protein precursors, and in particular of the 
ribulose-1 , 5-bisphosphate carboxylase small subunit targeting 
peptide, for importing a protein of interest into 

15 chloroplasts ; PCT application WO 00/12732 proposes the use of 
- targeting peptides from various plastid proteins, for 
importing proteins of interest into plasts. 

The plastid functions can be modified in this 
way, and the characteristics conferred by these modifications 

2 0 are very diverse. 

For the purposes of nonlimiting illustration, 
mention may be made of: 

an increase in herbicide resistance, by 
expression of the precursor of acetolactate synthetase (ALS) , 

25 (LEE et al. EMBO J., 7, 1241-1248, 1988), of mutated 
acetolactate synthetase (PRESTON and POWLES, Heredity 88, 
8-13, 2002); CHONG and CHOI, Biochem. Biophys . Res. Commun. 
279, 462-467, 2000), or of 3-enolpyruvylshikimate-5-phosphate 
synthetase (EPSP synthetase) (KLEE et al . , Mol . Gen. Genet., 

30 210, 437-442, 1987); 

- an increase in resistance to various stresses, 
by expression of zeaxanthin epoxidase, (SEO et al., Trends 
Plant Sci., 7, 41-48, 2002), of choline monooxygenase (SHEN 
et al., Sheng Wu Gong Cheng Xue Bao, 17, 1-6, 2001), of the 

35 product of the ERD1_ARATH gene (KIYOSUE et al . , Biochem. 
Biophys. Res. Commun., 15, 196, 1214-1220, 1993), of 
ferrochelatase (CHOW et al . , J. Biol. Chem. , 31, 272, 27565- 
27571, 1997), of omega -3 fatty acid desaturase (IBA et al . , 
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Tanpakushitsu Kakusan Koso, 39, 2803-2813, 1994; MURAKAMI et 
al.; Science 21, 287(5452), 476-479, 2000), or glutamine 
synthetase (FUENTES et al . , J. Exp. Bot . , 52, 1071-1081, 
2001) ; 

5 - modification of plast metabolism, so as to 

increase the capture of light energy (GAUBIER et al . , Mol . 
Gen. Genet., 1, 249, 58-64, 1995), ^ the photosynthetic and 
growth capacities (MIYAGAWA et al . , Nature Biotech., 19, 965- 
969, 2001), the carotenoid content (HUGUENEY et al . , Eur. J. 

10 Biochem. , 1, 209, 399-407, 1992; MANN et al . , Nature 
Biotech., 18, 888-892, 2000), or the content of various 
substances of interest, such as starch (PCT application 
WO 00/11144), essential amino acids (MUEHLBAUER et al . , Plant 
r Physiol., 106, 1303-1312, 1994), provitamin A (ROMER et al . , 

15 Nature Biotech., 18, 666-669, 2000), hormones (JOYARD et al . , 
> Plant Physiol. 118, 715-723, 1998), etc.; 

overexpression and chloroplast- targeting of 
proteins that can be used for the purposes of bioremediation 
(detoxification or depollution of contaminated soils) , such 

20 as ferritin, (LOBREAUX et al . , Biochem. J., 15, 288 (Pt 3), 
931-939, 1992), proteins of the phytochelatin family (CAZALE 
and CLEMENS, FEBS 507, 215-219, 2001 ; TSUJI et al . , BBRC 
293, 653-659, 2002), etc. 

All the intraplastid-targeting peptides known in 

25 the prior art make it possible to import a protein into 
plasts by means of the TOC and TIC membrane import systems, 
as indicated above. It has been noted that the use of these 
peptides for targeting proteins of interest into chloroplasts 
may, in particular when the targeting peptide/protein of 

3 0 interest construct is placed under control of a strong 
promoter such as the 3 5S promoter, has the drawback of 
saturating these import systems, by competing with the 
proteins naturally targeted to the chloroplast. As a result 
of this, "leakages" occur, resulting, after a few days, in 

35 the presence of the protein of interest in other subcellular 
compartments such as the cytoplasm. 

It would be desirable to have intraplastid- 
targeting peptides which would not depend on the TOC/TIC 
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import system and would therefore make it possible to avoid 
the abovementioned drawbacks. 

In previous studies aimed at identifying, by 
means of a proteomic approach, proteins from spinach 
5 chloroplast membrane preparations, the inventors identified, 
inter alia, peptides having considerable sequence similarity 
with a putative 41 kDa protein from Arabidopsis (TrEMBL 
accession number Q9SV68) ( SEIGNEURIN-BERNY et al . , Plant. J. 
19, 217-228, 1999; FERRO et al . , Electrophoresis 21, 3517- 
10 3526, 2000) . 

In continuing their studies in order to more 
fully characterize the Arabidopsis protein, and its homologue 
in spinach, the inventors noted that, surprisingly, although 
this involves proteins synthesized in the cytoplasm and 

15 imported at the inner membrane of the chloroplast, the 
- importation thereof was carried out without cleavage of a 
targeting peptide; in addition, sequence analysis of these 
proteins reveals no sequence having the characteristics of 
known plastid- targeting peptides. 

2 0 The proteins of the family represented by the 41 

kDa Arabidopsis protein, and the homologue spinach protein, 
are referred to hereinafter as IE41 (IE for Inner Envelope 
according to the nomenclature conventionally used for this 
membrane system) . 

2 5 The sequence of the Arabidopsis IE41 protein is 

represented in the sequence listing in the appendix under the 
number SEQ ID NO: 1; the sequence of the cDNA encoding the 
spinach IE41 protein is represented in the sequence listing 
in the appendix under the number SEQ ID NO: 2, and the 

3 0 corresponding polypeptide sequence is represented in the 

sequence listing in the appendix under the number SEQ ID NO: 
3. 

The inventors investigated which regions of the 
IE41 proteins were involved in their plastid targeting, and 
35 identified a region of 41 amino acids (residues at 60 to 100) 
that was essential to this targeting. 

The sequence of this region is represented in the 
sequence listing in the appendix under the number SEQ ID NO: 
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4 for the Arabidopsis IE41 protein, and under the number SEQ 
ID NO: 5 for the spinach IE41 protein. 

They also noted that, when fragments of IE41 
containing this region were fused to the N- terminal end of a 
5 heterologous protein, the recombinant protein resulting from 
this fusion was targeted to chloroplasts in a manner similar 
to the whole IE41 protein. 

A subject of the present invention is an 
intraplastid-targeting polypeptide, characterized in that it 
10 comprises : 

- a domain A consisting of a polypeptide having 
at least 60%, preferably at least 70%, advantageously at 
least 80%, and entirely preferably at least 90% identity, or 
at least 65%, preferably at least 75%, advantageously at 

15 least 85%, and entirely preferably at least 95% similarity, 
with one of the polypeptides SEQ ID NO : 4 or SEQ ID NO: 5; 
and at least one domain chosen from: 

- a domain B located at the N- terminal end of 
domain A, and consisting of a fragment of one of the 

2 0 polypeptides SEQ ID NO: 1 or SEQ ID NO: 3 comprising at least 

amino acids 49 to 59, preferably at least amino acids 39 to 
59, advantageously at least amino acids 29 to 59, entirely 
preferably at least amino acids 19 to 59, and particularly 
advantageously at least amino acids 9 to 59 of said 
25 polypeptide, or else of a polypeptide having at least 60%, 
preferably at least 70%, advantageously at least 80%, and 
entirely preferably at least 90% identity, or at least 65%, 
preferably at least 75%, advantageously at least 85%, and 
entirely preferably at least 95% similarity, with said 

3 0 fragment; 

- a domain C located at the C-terminal end of 
domain A, and consisting of a fragment of one of the 
polypeptides SEQ ID NO: 1 or SEQ ID NO : 3 comprising at least 
amino acids 101 to 111, preferably at least amino acids 101 

35 to 121, advantageously at least amino acids 101 to 131, 
entirely preferably at least amino acids 101 to 141, and 
particularly advantageously at least amino acids 101 to 151 
of said polypeptide, or else of a polypeptide having at least 
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60%, preferably at least 70%, advantageously at least 80%, 
and entirely preferably at least 90% identity, or at least 
65%, preferably at least 75%, advantageously at least 85%, 
and entirely preferably at least 95% similarity, with said 
fragment . 

The percentage identities or the percentage 
similarities mentioned here are determined by means of the 
BLASTp software (ALTSCHUL et al . , Nucleic Acids Res. 25, 
3389-3402, 1997), using the default parameters. 

Domains A, B and/or C defined above can come from 
the same IE41 protein; they can also come from IE41 proteins 
of different origins. 

A subject of the present invention is also any 
chimeric polypeptide resulting from the fusion of an 
intraplastid- targeting polypeptide in accordance with the 
invention with a heterologous polypeptide. Said heterologous 
polypeptide may be any polypeptide of interest that it is 
desired to introduce into plasts. Preferably, the 
intraplastid- targeting polypeptide in accordance with the 
invention is placed at the N-terminal end of the heterologous 
peptide. It could, however, also be placed within this 
peptide, or else at its C- terminal end. 

A subject of the present invention is also the 
use of an intraplastid- targeting polypeptide in accordance 
with the invention, for the importation of a protein of 
interest into plasts, and advantageously for the targeting of 
said protein to the inner plastid membrane. 

According to a preferred embodiment of the 
present invention, said intraplastid- targeting polypeptide is 
used for the importation of said protein of interest into 
chloroplasts . 

In particular, a subject of the present invention 
is a method for importing a protein of interest into plasts, 
characterized in that it comprises the expression, in a plant 
cell containing said plasts, of a chimeric polypeptide 
resulting from the fusion of an intraplastid- targeting 
polypeptide in accordance with the invention, with a 
heterologous polypeptide. 



A subject of the present invention is also: 

any polynucleotide encoding an intraplastid- 
targeting polypeptide or a chimeric polypeptide in accordance 
with the invention; 

- any recombinant expression cassette comprising 
a polynucleotide in accordance with the invention placed 
under the control of suitable sequences for regulating the 
transcription (in particular transcription promoter and 
terminator) ; 

any recombinant vector resulting from the 
insertion, into a suitable host vector, of a polynucleotide 
or of an expression cassette in accordance with the 
invention . 

A subject of the present invention is also host 
cells harboring a polynucleotide, an expression cassette or a 
recombinant vector in accordance with the invention. 

The present invention also encompasses 
transgenic plants genetically transformed with a 
polynucleotide or an expression cassette in accordance with 
the invention, and also the progenies of these plants. The 
invention also comprises the plant cells and tissues, and 
also the organs or parts of plants, including leaves, stems, 
roots, flowers, fruits and/or seeds obtained from these 
plants . 

Conventional techniques for constructing 
recombinant vectors, for transforming host cells or 
organisms, and for producing recombinant proteins can be used 
for implementing the present invention. 

The choice of the host vector and of the 
sequences for regulating the expression will be made in 
particular according to the method of transformation and to 
the host plant chosen, and/or to the type of cell or of 
tissue in which it is desired to obtain the expression. 

A very large number of promoters that can be used 
for the expression in plant cells are known in themselves. By 
way of examples, a constitutive promoter, such as the CaMV 
35S promoter or its derivatives, or the promoter of actin or 
ubiquitin, etc., may be chosen. An inducible promoter or else 
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a tissue-specific promoter may also be chosen, so as to 
effect the plastid- targeting of the protein of interest only 
at certain stages of the plant's development, under certain 
environmental conditions, or in certain target tissues. 

For example, if it is desired to preferentially 
obtain targeting of the protein of interest to chloroplasts , 
a chimeric polypeptide in accordance with the invention will 
be expressed under the control of a promoter specific for 
tissues or organs that are rich in plasts. By way of 
examples, the promoters of the chlorella virus that regulate 
expression of the adenine methyltransf erase gene (MITRA and 
HIGGINS, Plant Mol . Biol. 26, 85-93, 1994) or that of the 
cassava mosaic virus (VERDAGUER et al . , Plant Mol. Biol. 37, 
1055-1067, 1998) are expressed mainly in the green tissues. 
The regulatory elements of the promoter of the tomato 2A11 
gene allow specific expression in the fruit (VAN HAAREN and 
HOUCK, Plant Mol. Biol. 17, 615-630, 1991), etc. 

Methods for transforming plant cells or whole 
plants are well known in themselves: by way of nonlimiting 
examples, mention will be made of the transformation of 
protoplasts in the presence of polyethylene glycol, 
electroporation, the use of a particle gun, cytoplasmic or 
nuclear microinjection, or transformation by means of 
Agroba c t eri urn . 

The present invention can be implemented in the 
usual applications of plast- targeting peptides, and in 
particular in those mentioned above, so as to act on various 
plastid functions. This involves, in particular, the 
modification of functions specific to the inner membrane of 
the envelope, for example the biosynthesis of pigments, of 
quinones, of fatty acids, of vitamins and of plant hormone 
precursors, but also the importation of all the ions and 
metabolites into the plast. 

The characteristics of the plastid- targeting 
peptides in accordance with the invention, which are very 
different from those of the known plastid- targeting peptides, 
make it possible to suppose that the targeting peptides in 
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accordance with the invention use an import system that is 
different from that involving the TOC and TIC proteins. 

As a result of this, the proteins of interest 
targeted to plasts by means of a targeting peptide in 
5 accordance with the invention would not compete with the 
proteins naturally targeted to the plast by means of the TOC 
and TIC system, and would not saturate the latter. This would 
make it possible in particular to prevent leakages into the 
other subcellular compartments, and to conserve the proteins 
10 of interest in the chloroplast, even after several days of 
expression. This would also make it possible to target to the 
plasts proteins for which conventional importation by means 
of an N-terminal cleavable targeting sequence and using the 

- TIC/TOC system would not be functional. 

15 The present invention will be understood more 

- fully from the further description which will follow, which 
refers to nonlimiting examples illustrating the obtaining and 
characterization of plastid- targeting peptides in accordance 
with the invention and their use for the importation of 

2 0 heterologous proteins into chloroplast s . 

EXAMPLE Is CHARACTERIZATION AND CLONING OF THE ARABIDOPSIS 
THALIANA IE41 PROTEIN 

In prior studies aimed at identifying the most 
hydrophobic proteins of spinach chloroplast membrane 
25 preparations (SEIGNEURIN-BERNY et al . , Plant. J., 19, p. 217- 
228, 1999 ; FERRO et al . , Electrophoresis, 21, 3517-3526, 
2000) , several peptides derived from a 41 kDa protein 
inhibiting great sequence similarity with a putative protein 
from Arabidopsis (TrEMBL accession number Q9SV68) were 

3 0 demonstrated . 

However, analysis of the primary sequence of this 
41 kDa protein by means of the TMPred program (HOFMANN and 
STOFFEL, Biol. Chem. Hoppe-Seyler . , 347, 166, 1993), did not 
make it possible to detect transmembrane segments able to 
35 provide anchoring of the protein in a lipid bilayer. 

To confirm the location of this protein in the 
chloroplast envelope, the corresponding cDNA was cloned and 
the recombinant protein was overexpressed in E . coli in order 
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to obtain polyclonal antibodies directed against this 
protein . 

Expression in E. coli 

The cDNA encoding the 41 kDa Arabidopsis protein 
5 is obtained by PCR, from an Arabidopsis cDNA library, using 
the following primers: 

TCA CATATG GCTGGAAAACTCAATGCAC (SEQ ID NO : 10) 

which makes it possible to introduce an Ndel 
restriction site (underlined) at the 5' end of the cDNA; 
10 ATG GATCCA ACGCTCTTATGGCTCGAC (SEQ ID NO : 11) 

which makes it possible to introduce a BamHI 
restriction site (underlined) at the 3' end of the cDNA . 

The amplification fragment is cloned into the 
plasmid pBluescript KS~ . The insert is then digested with the 
15 Ndel and BamHI restriction enzymes, and inserted into the 
expression vector pET-15b (NOVAGEN) . 

The resulting vector allows the expression of a 
recombinant protein having a polyhistidine extension (His- 
tag) at its N-terminal end [ (His-tag) -P41 ] . 
20 This vector is used for transforming E. coli 

strain BLR (DE3 ) bacteria. 

The recombinant bacteria are cultured in 500 ml 
of LB medium containing 100 (ag/ml of ampicillin at 37°C. When 
the optical density at 600 nm (D0 6 oo) of the E. coli culture 
25 reaches 0.6, IPTG (isopropyl-p-D-galactothiopyranoside) is 
added at a final concentration of 1 niM so as to induce 
expression of the 41 kDa protein. 

After 3 hours of culture, the cells are 
centrifuged for 2 min at 13 000 rpm (EPPENDORF 5415D) . 
30 The pellet is resuspended in 20 ml of lysis 

buffer (50 mM NaH 2 P0 4 , 300 mM Na CI, 10 mM imidazole, pH 8) 
and the bacteria are lysed by sonication (OMRON sonicator, 
type STP. YM. 220 .VAC, 6x1 min, 0°C) . 

After sonication, the total protein extract is 
35 analyzed by SDS 12% PAGE. The gels are stained with Coomassie 
blue (R-250, BIORAD) . 

The results are given in figure 1A: 
Legend of figure 1A: 
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pET-15b = bacteria transformed with the nonrecombinant 
plasmid (negative control) 

pET-15b + insert = bacteria transformed with the recombinant 
plasmid, 
5 - = no induction with IPTG, 
+ = induction with IPTG, 

* = band corresponding to the recombinant 41 kDa protein. 

After induction with IPTG, the recombinant 
protein is strongly expressed in the bacteria transformed 
10 with the plasmid pET-15b containing the Arabidopsis cDNA 
insert . 

Solubilization of the recombinant protein 

The bacterial pellet is suspended in 1 ml of 
20 mM Tris/HCl buffer, pH 6.8, and then lysed by sonication 

15 (OMRON sonicator, type STP . YM. 220 . VAC, 6x1 min, 0°C) . 

After sonication, a first centrif ugation (2 min, 
13 000 rpm) of the total protein extract makes it possible to 
isolate the soluble proteins in the supernatant. The 
insoluble proteins in the pellet are suspended in 1 ml of a 

20 buffer containing detergent (20 mM Tris/HCl pH 6 . 8 , 0.5% 
Triton X-100) . A second centrif ugation makes it possible to 
isolate the membrane proteins solubilized with Triton X-100. 
The nonsolubilized proteins are resuspended in 2 0 mM 
Tris/HCl, pH 6.8 and analyzed. The various fractions are 

25 analyzed by SDS-12% PAGE, as indicated above. 

The results are given in figure IB: 
Legend of figure IB: 

pET-15b ~ bacteria transformed with the nonrecombinant 
plasmid (negative control) , 
3 0 pET-15b + insert = bacteria transformed with the recombinant 
plasmid, 

S = soluble proteins, 

D = proteins solubilized in Triton X-100, 
I = proteins not solubilized in Triton X-100, 
3 5 * = band corresponding to the recombinant 41 kDa protein. 

It is noted that the recombinant protein (*) is 
found in 2 different fractions: a water-soluble fraction 
present in the E. coli cytosol and an insoluble fraction, 
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which is not solubilized by Triton X-100, which indicates 
that it is probably aggregated in the form of inclusion 
bodies . 

Purification of the recombinant protein 

5 The soluble fraction of the recombinant (His- 

tag)-P41 protein is purified by affinity chromatography. 
After centrifugation (10 min at 6000 rpm, EPPENDORF 5415D) , 
the supernatant is loaded onto a 2 . 5 ml "His-bind resin" 
affinity column (NOVAGEN) charged with 13 ml of charged 
10 buffer (50 mM NiS0 4 ) and equilibrated with 5 ml of 
equilibrating buffer (20 mM tris/HCl, pH 7.9, 5 mM imidazole, 
0 . 5 M NaCl) . 

The column is washed with 2 volumes containing 

3 5 mM imidazole, and 2 volumes of lysis buffer containing 
15 60 mM imidazole (L2) . The protein is eluted with 6 volumes of 

lysis buffer containing 250 mM imidazole. The various 

fractions are analyzed by SDS-12% PAGE and revealed with 

Coomassie blue. 

The results are given in figure 2. 
2 0 Legend of figure 2: 

C = bacterial pellet diluted in the lysis buffer (10 (il) , 

S = soluble bacterial proteins (10 |il), 

P = proteins not bound (10 jul ) , 

L, LI, L2 = washes with the lysis buffer (35 and 60 mM 
25 imidazole) (15 ^il ) , 

Elution = fraction eluted in the presence of 250 mM 
imidazole . 

Production of polyclonal antibodies 

The purified recombinant (His-tag) -P41 protein is 
30 desalified (SEPHADEX G25 column) and stored at -80°C. 

This recombinant protein is used for immunizing 
rabbits in order to produce polyclonal antibodies directed 
against the 41 kDa Arabidopsis protein. 

EXAMPLE 2 : LOCATION OF THE 41 KDA PROTEIN IN CHLOROPLASTS 

35 During the purification procedure, the 41 kDa 

protein behaves like a protein that is water soluble and 
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slightly hydrophobic, which raises the question of its 
effective association with the chloroplast envelope. 

Its subcellular location was therefore verified 
by analyzing various chloroplast fractions. 
5 Crude chloroplasts are obtained from 3 - 4 kg of 

spinach (Spinacia oleracea L. ) leaves and are purified by 
isopycnic centrif ugation on a Percoll gradient (DOUCE and 
JOYARD, Methods in chloroplast Molecular Biology. Edelman, 
M. , Hallick, R. and Chua, N.-H., eds . (Amsterdam : Elsevier 

10 Science Publishers BV) , 239-256, 1982). At this stage, 
protease inhibitors (1 mM PMSF, 1 mM benzamidine and 0,5 mM 
aminocaproic acid) are added in order to prevent any protein 
degradation. The purified chloroplasts are lysed in a 
hypotonic medium, and the envelope membranes are purified 

15 from the lysate by centrif ugation on a sucrose gradient 
(DOUCE and JOYARD, 1982, mentioned above) . Envelope 
subtractions respectively enriched in outer and inner 
membranes are obtained according to the protocol described by 
BLOCK et al. (J. Biol. Chem. , 258, 13273-13280, 1983). 

2 0 All the above steps are carried out at between 0 

and 5°C. The fractions obtained are stored in liquid nitrogen 
in 50 mM MOPS-NaOH, pH 7.8, in the presence of protease 
inhibitors (1 mM benzamidine and 0.5 mM aminocaproic acid). 
Analysis of the chloroplast fractions 

25 The SDS-PAGE analyses of the total chloroplasts, 

or of their envelope membrane, stroma or thylakoid membrane 
fractions (15 fig) , and also of the chloroplast envelope 
subfractions (15 jig) , are carried out as described by CHUA 
(Methods Enzymol . , 69, 434-436, 1980). The resolving and 

30 stacking gels (12-15% acrylamide) , like the migration buffer, 
contain 0.1% of SDS . The peptides are revealed either with 
Coomassie blue (MANIATIS et al . , Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1982) or with silver 
nitrate (MERRIL et al . , Anal. Biochem. , 110, 201-207, 1981). 

3 5 For the Western blotting analyses, the 41 kDa 

protein is detected using the polyclonal antibodies directed 
against the recombinant Arabidopsis protein produced as 
described in example 1, labeled with alkaline phosphatase. 
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The results are given in figure 3A. 
Legend of figure 3A: 
C = chloroplast proteins, 
E = envelope membrane proteins, 
5 S = stroma proteins, 

T = thylakoid membrane proteins . 

These results show that the 41 kDa protein is 
associated only with the chloroplast envelope and is not 
detected in the stroma or in the thylakoids . 

10 The purified intact chloroplasts are devoid of 

cytosolic proteins which can contaminate the preparation 
(DOUCE and JOYARD, 1980, mentioned above) . However, the 
41 kDa protein could interact specifically with the outer 
membrane of the chloroplast envelope and could thus be co- 

15 purified with the envelope preparations. In order to exclude 
this possibility, 20 ng of envelope derived from intact 
chloroplasts are treated with Bacillus thermoproteolyticus 
thermolysin (0, 20, 50 and 100 jig/ml) in order to digest the 
polypeptides located on the outer surface of the outer 

20 membrane of the envelope (JOYARD et al . , J. Biol. Chem. , 258, 
10 000-10 006, 1983). As a control, solubilized envelope 
proteins are subjected to the same proteolytic treatment. 

The presence of the 41 kDa protein is detected by 
Western blotting, as described above. 

2 5 The results are given in figures 3B and 3C. 

Legend of figures 3B and 3C : 
0 = absence of thermolysin, 
2 0 = thermolysin at 2 0 jag/ml, 
5 0 = thermolysin at 5 0 |ig/ml, 

30 100 = thermolysin at 100 |ig/ml . 

The 41 kDa protein is not affected by the 
treatment with thermolysin (figure 3B) , whereas the same 
proteolytic treatment carried out on solubilized envelope 
proteins shows the sensitivity of the 41 kDa protein to the 

35 thermolysin treatment (figure 3C) . This result excludes the 
hypothesis that the 41 kDa protein is located on the outer 
face of the outer membrane. In fact, the 41 kDa protein is 
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not a cytosolic protein contaminating the chloroplast 
envelope preparations. 

Envelope subfractions respectively enriched in 
outer and inner membranes are used to specify the sub- 
5 location of the 41 kDa protein at the level of the membrane 
of the chloroplast envelope. The nature of these envelope 
subfractions was confirmed using the markers IE18 and OEP24, 
which are respectively intrinsic proteins of the inner and 
outer membrane of the chloroplast envelope. The IE18 and 
10 OEP24 proteins are detected using polyclonal antibodies 
directed specifically against each of these proteins. 
The results are given in figure 3D. 
Legend of figure 3D : 
■ E = envelope membrane proteins, 
15 OM - outer membrane proteins, 
- IM = inner membrane proteins . 

The 41 kDa protein is found, like the IE18 
protein, only in the preparations of chloroplast envelopes 
enriched in inner membrane. 
2 0 All the results given in figures 3A - 3D show 

that the 41 kDa protein is located at the level of the inner 
membrane of the chloroplast envelope. 

According to conventional nomenclature, this 
inner envelope protein, which has a theoretical mass by 
25 polyacrylamide gel electrophoresis of 41 kDa, is called IE41 
(for "Inner Envelope Protein of 41 kDa"). 

Analysis of the interactions between the IE41 protein and the 
inner membrane of the chloroplast envelope 

In order to analyze more precisely the mode of 
30 interaction of the IE41 protein with the chloroplasts inner 
membrane, various hypotheses were tested: 

1) The IE41 protein is a soluble protein located 
in the intermembrane space between the outer and inner 
membranes of the chloroplast envelope. This protein would be 
35 co-purified with the envelope preparations and sequested in 
the envelope vesicules. In this case, sonication of the 
envelope vesicules would make it possible to release IE41. 
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To test this first hypothesis, envelope proteins 
(500 |ag) are solubilized in 50 mM MOPS, pH 7.8 (500 jal) . The 
envelope vesicules are sonicated for 10 sec and then 
centrifuged (20 min at 72 000 g, Beckman L2 65B, SW28 rotor) 
5 in order to separate the soluble proteins and the membrane 
proteins. Each fraction (20 fal) is analyzed by SDS-12% PAGE 
(visualization: Coomassie blue) and Western blotting. 

The results are given in figure 4A. 
Legend of figure 4A: 
10 - = no sonication, 
+ = sonication, 
S = soluble proteins, 
I = membrane proteins . 

The analyses by SDS-PAGE and Western blotting of 
15 the treated envelope fractions show that the IE41 protein is 
not solubilized after sonication of the envelope vesicules. 
On the contrary, the major soluble proteins of the stroma 
(RbcL) which are sequested in the envelope vesicules, and 
which are known to contaminate the envelope fractions, are 
20 solubilized by this treatment. This shows that the IE41 
protein is neither a soluble protein of the intermembrane 
space, nor a soluble protein of the stroma that contaminates 
the purified envelope fraction. 

2) The IE41 protein may be bound to the inner 

25 membrane: 

- either by anchoring to the lipid bilayer or 
partial insertion into said bilayer; in this case, only the 
use of detergent can enable the IE41 protein to be 
solubilized; 

30 - by electrostatic interactions with one or more 

membrane proteins or the polar surface of the lipid bilayer; 
in this case, these interactions can be broken, and the IE41 
protein can be solubilized by means of an alkaline treatment 
or by means of high salt concentrations. 

3 5 In order to determine the type of interactions 

involved in the binding of IE41 with the inner membrane, the 
following experiments were carried out: 

a) solubilization with Triton X-100 : 



Envelope vesicules (0.8 mg) are diluted in 1 ml 
of 50 mM MOPS, pH 7.8, containing 0.05, 0.1 or 0.2% (v/v) of 
Triton X-100. After incubation for 30 min at 4°C, the mixture 
is centrifuged in order to separate the soluble proteins and 
the membrane proteins. All the fractions (20 (il) are analyzed 
by SDS-12% PAGE (visualization: Coomassie blue) and Western 
blotting . 

The results are given in figure 4C . 
Legend of figure 4C : 
Mix = purified envelope vesicules, 
M - envelope membrane proteins, 
S = envelope soluble proteins. 

These results show that the IE41 protein can be 
completely solubilized with concentrations of Triton X-100 
(0.2%) that are much lower than those (of the order of 2%) 
which are necessary for solubilizing the intrinsic proteins. 

b) solubilization by alkaline treatment or by 
salt treatment. 

Purified envelope vesicules (500 \xg) are 
incubated for 30 min at 4°C in various media (500 |il) : 

1) 10 mM MOPS, pH 7.8 + 0.5 M NaCl; 

2) 10 mM MOPS, pH 7.8 + 0.5 M KI ; 

3) 0.1 M Na 2 C0 3 , pH 11; 

4) 0.1 N NaOH, 

and then sonicated and centrifuged as described 
above in order to separate the soluble proteins and the 
membrane proteins . 

All the fractions are analyzed (20 jal ) by SDS-12% 
PAGE (visualization: Coomassie blue) and Western blotting 
(detection with the polyclonal antibodies directed against 
the Arabidopsis IE41 protein (1/5000 dilution) or directed 
against the IE18 protein (1/5000 dilution)). 

The results are given in figure 4B 
Legend of figure 4B : 
+ = sonication (10 sec) , 
NaCl 0.5 M = treatment 1, 
KI 0 . 5 M = treatment 2 , 
0.1 M Na 2 C0 3 , pH 11 = treatment 3, 
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0.1 N NaOH = treatment 4, 

S = soluble protein fraction, 

I = insoluble protein fraction. 

These results show that the IE41 protein is at 
5 least partly solubilized by the salt (KI, NaCl) or moderately 
alkaline (Na 2 C0 3 ) treatments, which have no effect on the 
intrinsic protein IE18, it only being possible for the latter 
to be solubilized by a strong alkaline treatment (NaOH) . 

All the results given in figures 4A, 4B and 4C 
10 indicate that the IE41 protein is an extrinsic protein, the 
binding of which to the inner membrane of the chloroplast 
envelope involves electrostatic interaction. 

EXAMPLE 3 : PURIFICATION AND CHARACTERIZATION OF THE SPINACH 
IE41 PROTEIN 

15 Surprisingly, the IE41 protein purified from 

spinach chloroplasts and the recombinant Arabidopsis protein 
have a similar size by SDS-PAGE and Western blotting, which 
suggest the possibility that IE41 may be targeted to the 
inner membrane of the envelope without requiring the cleavage 

2 0 of an N- terminal import sequence. 

To test this hypothesis, the 1E41 protein present 
in the envelope of spinach chloroplasts was purified in order 
to sequence it and to compare its sequence with that of the 
corresponding cDNA . 

2 5 Iimnunopurif ication of the spinach IE41 protein 

Chloroplast envelope proteins (1 mg) are 
solubilized in 1 ml of 50 inM Tris/HCl buffer containing 
150 mM NaCl and 6 mM CHAPS and centrifuged (20 min, 72 000 g, 
Beckman L2 65B, SW2 8 rotor) . The soluble proteins are 

30 incubated for 1 h at 4°C with 33 jal of polyclonal serum 
directed against the recombinant Arabidopsis IE41 protein. 
50 ^g of agarose-protein A (BOEHRINGER) are then added and 
the mixture is incubated for 3 h at 4°C. After 3 successive 
washes by centrif ugation (EPPENDORF 5415D, 12 000 rpm, 

35 20 min, 4°C) and resuspension of the pellet in 1 ml of 
solubilizing buffer (20 mM MOPS, 150 mM NaCl, 6 mM CHAPS, 
pH 7.8), an excess (50 jig) of recombinant protein (His-tag)- 
IE41, in 2 00 |il of solubilizing buffer, is added. The mixture 
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is incubated for 1 h at 4°C, and centrifuged for 20 min at 
12 000 rpm (EPPENDORF 5415D) . The supernatant is incubated 
for 1 h with Ni-NTA resin (QIAGEN) , pre-equilibrated in the 
solubilizing buffer, in order to eliminate most of the (His- 
5 tag)-IE41 recombinant protein. 

Each fraction (20 ^1 ) is analyzed by SDS-12% PAGE 
(visualization with silver nitrate) and Western blotting 
(using the rabbit anti-IE41 polyclonal antibodies described 
in example 1) . 
10 The results are given in figure 5. 

Legend of figure 5 : 
A: analysis by SDS-PAGE; 
B: Western blotting; 

Mix = solubilized envelope proteins; 
15 C = insoluble proteins ; 
„ S = soluble proteins; 

LI, L2 , L3 = fractions recovered in the course of the 3 
successive washes ; 

El = fraction eliminated by incubation with the Ni-NTA resin; 
20 E2 = purified spinach IE41 protein (So) + (His- tag) -IE41 . 

The E2 fraction comprising the purified natural 
spinach IE41 protein (So) remains contaminated by the 
recombinant Arabidopsis (His- tag) -IE41 protein. 

The difference in size between these two proteins 
25 corresponds to the polyhistidine extension (His-tag) present 
at the N- terminal end of the recombinant protein. 

EXAMPLE 4 : OBTAINING THE cDNA ENCODING THE SPINACH IE41 
PROTEIN 

Partial sequencing of the spinach IE41 protein 
30 eluted from the electrophoresis gel made it possible to 
obtain 9 different peptide sequences. These sequences were 
used to define degenerate primers which made it possible to 
isolate the cDNA encoding the IE41 protein. 

The complete sequence of this cDNA (SEQ ID No: 
35 2), and also the deduced polypeptide sequence (SEQ ID NO: 3), 
are given in figure 6. The translation initiation codon ATG 
is indicated in bold letters, and the stop codon TAA is 
underlined. The 9 peptide sequences obtained by direct 
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sequencing of the spinach IE41 protein are highlighted in 
gray. The correspondence between the peptides obtained with 
the sequence deduced from the cDNA of the spinach IE41 
protein, in particular in the N- terminal region, and also the 
5 presence of a stop codon downstream of the initiating 
methionine and in the same reading frame, demonstrate that 
the predicted cDNA is complete and that this protein does not 
undergo any post- translational maturation during its 
targeting to the inner membrane of the chloroplast envelope. 

10 The spinach IE41 protein has 75.1% identity and 

88.8% similarity with the Arabidopsis IE41 protein. This high 
similarity, and the fact that Arabidopsis contains only one 
ie42 gene per haploid genome, make it possible to conclude 
■- that these proteins are encoded by orthologous Arabidopsis 

15 and spinach genes. 

The Arabidopsis and spinach IE41 proteins were 
aligned with homologous proteins from a bacterium, from a 
yeast and from animals. 

The results are given in figure 7 . 

20 Arabidopsis thaliana: IE41 ATH (SEQ ID NO: 1), 
Spinach: IE41 SOL (SEQ ID NO : 3); 

homologous proteins : 
Esherichia coli: QORECOLI (SEQ ID NO: 6), 
Saccharomyces cerevisiae: QORYEAST (SEQ ID NO: 7) , 

25 Cavia Porcellus: QORCAVPO (SEQ ID NO: 8), 
Mouse: QORMOUSE (SEQ ID NO : 9). 

The residues conserved in the 6 peptide sequences are 
highlighted in dark gray. The residues conserved in the IE41 
sequence and in at least one other homologous protein 

3 0 sequence are highlighted in light gray. The similarities 
between residues are based on the following groups: ASPTG, 
ILMV, KRH, NQ, DE , YWF and C. 

The homology searches indicate that the IE41 
protein belongs to the dehydrogenase super family, and more 

35 particularly to the group of ^-crystalline- type quinone 
oxidoreductases (JORNVALL et al . , FEBS 3, 240-244, 1993). In 
addition, the sequence comparison between the IE41 proteins 
and the other proteins of the same family reveals that the 
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first 50 residues in the N-terminal region of these proteins 
are very conserved between bacteria, plants and animals. This 
observation suggests that this N-terminal region of the plant 
IE41 proteins is not involved in targeting into the 
5 chloroplast, and is more probably conserved due to the 
selection pressure exerted during evolution on the catalytic 
domain of the protein. 

EXAMPLE 5: ANALYSIS OF THE PLASTID- TARGETING OF THE 
ARABIDOPSIS IE41 PROTEIN IN ARABIDOPSIS AND TOBACCO CELLS 

10 In order to define the domain essential to the 

importation of the IE41 protein, various constructs encoding 
truncated forms of this protein, fused to GFP, are expressed 
in Arabidopsis and tobacco cells. 
Construction of the expression vectors; 

15 The plasmid [35Q-SGFP (S65T) ] used for these 

constructs, which comprises the sequence encoding GFP under 
the control of the 35S promoter, and also the plasmid [35f2- 
TP-sGFP (S65T) ] , which comprises the sequence encoding the 
targeting peptide (TP) of the ribulose-1 , 5-bisphosphate 

2 0 carboxylase small subunit, fused to the sequence encoding 
GFP, were previously described by CH1U et al . (Curr. Biol., 
6, 325-330, 1996) . 

The sequence encoding the Arabidopsis IE41 
protein is amplified by PGR using the following two primers: 

2 5 XhoI-N-ter CCT CTCGAGA TGGCTGGAAAACTCATGCAC (SEQ ID NO: 12), 

and 

NcoI-C-ter CAAC CCATGGA TGGCTCGACAATGATCTTC (SEQ ID NO : 13), 
which introduce, respectively, an Xhol site and an Ncol site 
(underlined) . 

3 0 The PCR product is cloned, blunt-ended, into the 

vector pBLUESCRIPT KS (STRATAGENE) . The Xhol -Ncol fragment 
■ cleaved from the plasmid thus obtained is inserted into the 
plasmid 35Q-SGFP (S65T) digested beforehand with Sall-Ncol , 
in order to create the vector 35Q-IE41-SGFP ( S65T) , 
3 5 comprising the coding region of the Arabidopsis IE41 protein, 
fused to GFP. A similar protocol is used for the other 
constructs : 



* The sequence encoding the Arabidopsis IE41 
protein lacking the first 31 amino acids is obtained by PCR 
amplification using the following two primers: 

Sall-N-ter CGGTT GTC G AC A TG AAG AGT AATG AGGTTTGC C TG (SEQ ID NO: 
14) 

NcoI-C-ter CAAC CCATGG ATGGCTCGACAATGATCTTC (SEQ ID NO : 13) . 

The plasmid 35Q-A(1-31) IE41-sGFP (S65T) is 
obtained by insertion of this sequence into the plasmid 3 5Q- 
sGFP(S65T) . 

* The sequence encoding the Arabidopsis IE41 
protein lacking the first 59 amino acids is amplified by PCR 
using the following two primers: 

Sall-N-ter GAATG GTCGAC ATGTTTCTGCCCCGCAAGTTC { SEQ ID NO : 15), 
and 

NcoI-C-ter CAAC CCATGGA TGGCTCGACAATGATCTTC (SEQ ID NO: 13) . 

The plasmid 35fi-A(l-59) IE41-sGFP (S65T) is 
obtained by insertion of this sequence into the plasmid 3 5Q- 
sGFP(S65T) . 

* The sequence encoding the Arabidopsis IE41 
protein lacking the first 99 amino acids is amplified by PCR 
using the following two primers: 

Sall-N-ter GGTTGTCGACATGCTAGGTGGAGGTGGACTTG (SEQ ID NO: 16) 
NcoI-C-ter CAAC CCATGG ATGGCTCGACAATGATCTTC (SEQ ID NO : 13). 

The plasmid 35Q-A(l-99) IE41-sGFP (S65T) is 
obtained by insertion of this sequence into the plasmid 3 5Q- 
sGFP(S65T) . 

* The sequence encoding the amino acids 6-100 of 
the Arabidopsis IE41 protein is amplified by PCR using the 
following two primers: 

XhoI-N-ter CCT CTCGAGA TGGCTGGAAAAACTCATGCAC (SEQ ID NO: 17) 
NcoI-C-ter AC CCATGG CTAGATGGCTAAGAACCGCTAC (SEQ ID NO: 18) . 

The primer SEQ ID NO: 17 comprises an additional 
nucleotide compared with the primer SEQ ID NO: 15, which 
creates a reading frame shift in the amplification product, 
the translation of which begins at the ATG codon 
corresponding to the methionine at position 6 of the IE41 
protein. 
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The plasmid 35Q- ( 6-100 ) IE41-sGFP ( S65T) is 
obtained by insertion of this sequence into the plasmid 3 5Q- 
sGFP(S65T) . 

* The sequence encoding amino acids 60-100 of the 
5 Arabidopsis IE41 protein is amplified by PCR using the 
following two primers: 

Sall-N-ter GAATG GTCGACA TGTTTCTGCCCCGCAAGTTC (SEQ ID NO: 15) 
NcoI-C-ter AC CCATGG CTAGATGGCTAAGAACCGCTAC (SEQ ID NO : 18) . 
The plasmid 35Q- ( 60-100 ) IE41-sGFP (S65T) is obtained by 
10 insertion of this sequence into the plasmid 35Q-sGFP ( S65T) . 

These various constructs are given in figure 8. 
Legend of figure 8 : 
IE41=plasmid 3 5Q-IE41-sGFP ( S65T) 

A(1-31)IE41= plasmid 3 5H-A ( 1-31 ) IE41-sGFP ( S65T) 
15 A(1-59)IE41= plasmid 35Q-A ( 1-59 ) IE41-sGFP (S65T) 

A(1-99)IE41= plasmid 35Q-A ( 1-99 ) IE41-sGFP ( S65T) 
(6-100) IE41= plasmid 35Q- ( 6-100 ) IE41-sGFP (S65T) 
(60-100) IE41= plasmid 3 5Q- ( 60-100 ) IE41-sGFP ( S65T) 

Bombardment of Arabidopsis and tobacco cells 

2 0 The Arabidopsis cells are cultured in light for 3 

days in GAMBORG's B5 medium (SIGMA, pH 5.8) supplemented with 
1.5% sucrose and 1 |aM ANA (naphthaleneacetic acid). 15 ml of 
cell suspension (corresponding to approximately 0.5 g) are 
transferred into Petri dishes containing the same growth 
25 medium to which 0.8% bacto-agar has been added, and 
incubated for 18 - 36 h in the light. 

BY2 tobacco cells are cultured for 5 days at 27°C 
in MURASHIGE and SKOOG medium (MS medium, DUCHEFA, pH 5.8) 
supplemented with 3% sucrose, 0.2% KH 2 P0 4 , 0.2% myoinositol, 

3 0 1 |iM 2.4D (2 . 4-dichlorophenoxyacetic acid) and 3 jiM thiamine. 

2 . 5 ml of cell suspension (corresponding to approximately 
. 0.3 g) are transferred into Petri dishes containing the same 
growth medium to which 1% bacto-agar has been added, and are 
placed at 27°C for 18 - 24 h. 
35 The plasmids comprising the test constructs used 

for the tissue bombardment are prepared using the "QIAfilter 
Plasmid Midi Kit" (Qiagen, Germany) . 

The plasmid [35Q-sGFP (S65T) ] (GFP) and the 
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plasmid [ 35Q-TP-SGFP (S65T) ] (TP-GFP) are respectively used 
as a negative control and as a positive control. 

The plasmids (1 \xg) are introduced into the cells 
using a pneumatic particle gun (PDS-1000/He, BIORAD) . The 
5 bombardment conditions are as follows: helium pressure of 
1350 psi; 1100 psi rupture disks (BIORAD); 10 cm target 
distance; 1 |im gold microcarriers (BIORAD) are used. After 
bombardment, the cells are incubated on these same Petri 
dishes for 18-36 h (in the light for the Arabidopsis cells) , 
10 and then transferred onto glass slides before fluorescence 
microscopy. 

Fluorescence microscopy 

The location of the GFP and of the GFP-fusion 
peptides in analyzed in the transformed cells by fluorescence 
15 microscopy using a ZEISS AXIOPLAN 2 fluorescence microscope 
- and a digital CCD camera (HAMAMATSU) . The sets of filters 
used are: Zeiss filterset 13, 488013-0000 (exciter BP 470/20, 
beam splitter FT 493, emitter BP 505-530), and Zeiss filter 
set 15, 488015-0000 (exciter BP 546/12, beam splitter FT 580, 

2 0 emitter LP 590) for the GFP and the chlorophyll 

autof luorescence , respectively . 

Under these conditions, the presence of 
chlorophyll (specifically located in the chloroplasts ) and 
the location of the GFP in the cell are visualized by virtue 
25 of an intense fluorescence. 

In the Arabidopsis cells transformed with the 
constructs GFP, A(1-99)IE41, and ( 60-100 ) IE41 , the GFP 
fluorescence appears to be diffuse and located in the cytosol 
and the nucleus; no co-localization with chlorophyll is 

3 0 observed. 

In the Arabidopsis cells transformed with the 
. constructs IE41, A(1-31)IE41, A(1-59)IE41 and (6-100)IE41, 
and also with the positive control for localization TP-GFP, a 
co-localization is, on the other hand observed in the 
3 5 chloroplasts, between the GFP fluorescence and the 
chlorophyll autof luorescence . 

The results are similar in the nonchlorophyll- 
containing BY2 tobacco cells: the fluorescent labelings 
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observed with the constructs IE41, A(1-59)IE41 and 
(6-100) IE41, and with the positive control for localization 
TP-GFP, correspond to a plastid localization; on the other 
hand, the fluorescent labelings observed with the constructs 
5 GFP, A(1-99)IE41, and (60-100) IE41 correspond to a cytosolic 
and nuclear localization. 

These experiments show that the targeting is also 
effective in nonchlorophyll-containing plasts. 

All the results above show that: 
10 - the complete IE41 protein fused to GFP is 

targeted into the chloroplast; 

- the 59 residues located at the N- terminal are 
not essential to the importation; 

the 99 residues located at the N-terminal 
15 contain a region essential to the importation; 

- a sequence of 94 residues, corresponding to N- 
terminal amino acids 6-100, is sufficient to catalyze the 
importation; the 223 C-terminal residues (101-323) are 
therefore not essential to the importation. 

2 0 The internal sequence of 40 amino acids, ranging 

from residues 60-100, correspond to the domain that is 
essential for importation. However, this domain, which must 
be present in order to direct the protein to the plasts, is 
not sufficient for correct targeting. 

2 5 EXAMPLE 6 : IN PLANTA ANALYSIS OF THE PLASTID- TARGETING OF 
THE IE41 PROTEIN 

The plasmids 35Q-IE41-sGFP (S65T) , 35Q-A(1- 
31) IE41-sGFP(S65T) , 35Q-A ( 1-59 ) IE41-SGFP (S65T) , 35Q-A(1- 
99) IE41-sGFP(S65T) , 35Q- ( 6-100 ) IE41-SGFP (S65T) , and 3 5Q-(60- 
30 100) !E41-sGFP(S65T) , and also the control plasmids 
35Q-sGFP(S65T) and 35Q-TP-sGFP ( S65T) , were digested with 
. EcoRI/Hindlll in order to recover the expression cassettes. 

These cassettes were inserted into the binary 
plasmid pELl03 (derived from the plasmid pBI121 (AF485783), 
35 containing a kanamycin-resistance gene) , and the resulting 
plasmid was used to transform the Agrobacterium tumefaciens 
strain C58 by electroporation . The transformed bacteria were 
used to transform Arabidopsis WS plants by the "floral dip" 
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technique (The Plant Journal 1998; 16: 735-743). The 
transgenic plants are selected on the basis of their 
kanamycin resistance . 

In order to analyze the expression of the fusion 
5 proteins in the transgenic plants, the total proteins are 
extracted from 10 mg of leaves of each of the tested plants, 
and solubilized in the following buffer: tetrasodium pyro- 
phosphate (13.4 g/1) , Tris-HCl pH 6.8 (50 mM) , SDS (1 %) . 

The protein extract is analyzed by SDS-PAGE (12% 
10 acrylamide) , and by Western blotting using an anti-GFP 
antibody (antibody 2A5 (Euromedex) diluted 1/4000 in TBST/5% 
milk; secondary antibody: alkaline phosphatase- con jugated 
anti-mouse IgG (Promega) diluted 1/10 000 in TBS-Triton) , or 
the rabbit anti-IE41 polyclonal antibodies described in 
15 example 1 (diluted 1/5000 in TBS-Triton/ 5% milk; secondary 
antibody: alkaline phosphatase-con jugated anti-rabbit IgG 
(Promega) diluted 1/10 000 in TBS-Triton buffer) . 

The results of these analyses are illustrated in 

figure 9; 
20 Legend to figure 9: 

- A : SDS-PAGE analysis ; 

- B : Western blotting with the anti-GFP antibody: the black 
arrows indicate the presence of the GFP protein in the 
fusions expressed in Arabidopsis ; 

25 - C : Western blotting with an anti-IE41 antibody: the black 
arrows indicate the presence of the IE41 protein in the 
fusions expressed in Arabidopsis ; the white diamond indicates 
the position of the natural IE41 protein present in all the 
extracts ; 

3 0 - WT = non- transformed plant; 

- M = molecular weight markers; 

- GFP = plasmid 35Q-SGFP (S65T) ; 

- TP GFP = plasmid 35Q-TP-sGFP ( S65T) ; 

- IE41 GFP = plasmid 35Q-IE41-SGFP (S65T) ; 

35 -A(1-59)IE41 GFP = plasmid 35Q-A ( 1-59 ) IE41-SGFP ( S65T) ; 
-A(1-99)IE41 GFP = plasmid 35Q-A ( 1-99 ) IE41-SGFP ( S65T) ; 

- (6-100) IE41 GFP = plasmid 35Q- ( 6-100 ) IE41-SGFP (S65T) ; 

- (60-100) IE41 GFP = plasmid 3 5Q- ( 60-100 ) IE41-sGFP ( S65T) . 
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These results show that the fusion proteins are 
expressed in all the transformed plants. 

The subcellular location of the proteins 
expressed by the various constructs was visualized by 
5 fluorescence microscopy, as described in example 5 above. 

The results are given in figure 10; 
Legend of figure 10: 

- GFP = plasmid 35Q-SGFP (S65T) ; 

- TP-RBCS GFP = plasmid 35Q-TP-SGFP ( S65T) ; 
10 - IE41 GFP = plasmid 35Q-IE41-SGFP (S65T) ; 

- (6-100)IE41 GFP = plasmid 35Q- ( 6-100 ) IE41-sGFP (S65T) . 

It appears that, under the expression conditions 
used above, it is the ^-terminal region of the IE41 protein 
(residues 6 to 100) which confers the greatest specificity 

15 of targeting to the plast. In fact, the construct 
( 6-100 ) IE41-GFP, which expresses only this region, allows 
the systematic targeting of all the fluorescence to the 
plasts. On the other hand, the complete IE41 protein 
(construct IE41-GFP) induces a less specific plastid- 

20 targeting. Under these conditions, the IE41 protein also 
appears to be targeted to other intracellular compartments. 



